Search CORE

15 research outputs found

Revealing the True Cost of Local Privacy: An Auditing Perspective

Author: Arcolezi Héber H.
Gambs Sébastien
Publication venue
Publication date: 04/09/2023
Field of study

This paper introduces the LDP-Auditor framework for empirically estimating the privacy loss of Locally Differentially Private (LDP) mechanisms. Several factors influencing the privacy audit are explored, such as the impact of different encoding and perturbation functions of eight state-of-the-art LDP protocols. Furthermore, the influence of domain size as well as the theoretical privacy loss parameter

\epsilon

on local privacy estimation are also examined. Overall, our LDP-Auditor framework and findings offer valuable insights into the sources of randomness and information loss in LDP protocols, contributing to a more realistic understanding of the local privacy loss. Furthermore, we demonstrate the effectiveness of LDP-Auditor by successfully identifying a bug in an LDP library.Comment: Accepted for poster presentation at TPD

arXiv.org e-Print Archive

On the Risks of Collecting Multidimensional Data Under Local Differential Privacy

Author: Arcolezi Héber H.
Couchot Jean-François
Gambs Sébastien
Palamidessi Catuscia
Publication venue
Publication date: 28/12/2022
Field of study

The private collection of multiple statistics from a population is a fundamental statistical problem. One possible approach to realize this is to rely on the local model of differential privacy (LDP). Numerous LDP protocols have been developed for the task of frequency estimation of single and multiple attributes. These studies mainly focused on improving the utility of the algorithms to ensure the server performs the estimations accurately. In this paper, we investigate privacy threats (re-identification and attribute inference attacks) against LDP protocols for multidimensional data following two state-of-the-art solutions for frequency estimation of multiple attributes. To broaden the scope of our study, we have also experimentally assessed five widely used LDP protocols, namely, generalized randomized response, optimal local hashing, subset selection, RAPPOR and optimal unary encoding. Finally, we also proposed a countermeasure that improves both utility and robustness against the identified threats. Our contributions can help practitioners aiming to collect users' statistics privately to decide which LDP mechanism best fits their needs.Comment: Accepted at VLDB 202

arXiv.org e-Print Archive

HAL - Université de Franche-Comté

INRIA a CCSD electronic archive server

HAL-Polytechnique

Causal Discovery Under Local Privacy

Author: Arcolezi Héber H.
Binkytė Rūta
Jung Kangsoo
Lestyán Szilvia
Palamidessi Catuscia
Pinzón Carlos
Publication venue
Publication date: 15/11/2023
Field of study

Differential privacy is a widely adopted framework designed to safeguard the sensitive information of data providers within a data set. It is based on the application of controlled noise at the interface between the server that stores and processes the data, and the data consumers. Local differential privacy is a variant that allows data providers to apply the privatization mechanism themselves on their data individually. Therefore it provides protection also in contexts in which the server, or even the data collector, cannot be trusted. The introduction of noise, however, inevitably affects the utility of the data, particularly by distorting the correlations between individual data components. This distortion can prove detrimental to tasks such as causal discovery. In this paper, we consider various well-known locally differentially private mechanisms and compare the trade-off between the privacy they provide, and the accuracy of the causal structure produced by algorithms for causal learning when applied to data obfuscated by these mechanisms. Our analysis yields valuable insights for selecting appropriate local differentially private protocols for causal discovery tasks. We foresee that our findings will aid researchers and practitioners in conducting locally private causal discovery

arXiv.org e-Print Archive

Frequency Estimation of Evolving Data Under Local Differential Privacy

Author: Arcolezi Héber H.
Gambs Sébastien
Palamidessi Catuscia
Pinzón Carlos
Publication venue: HAL CCSD
Publication date: 23/12/2022
Field of study

under reviewCollecting and analyzing evolving longitudinal data has become a common practice. One possible approach to protect the users' privacy in this context is to use local differential privacy (LDP) protocols, which ensure the privacy protection of all users even in the case of a breach or data misuse. Existing LDP data collection protocols such as Google's RAPPOR and Microsoft's dBitFlipPM have longitudinal privacy linear to the domain size k, which can be excessive for large domains, such as Internet domains. To solve this issue, in this paper we introduce a new LDP data collection protocol for longitudinal frequency monitoring named LOngitudinal LOcal HAshing (LOLOHA) with formal privacy guarantees. In addition, the privacy-utility trade-off of our protocol is only linear with respect to a reduced domain size 2<=g<<k. LOLOHA combines a domain reduction approach via local hashing with double randomization to minimize the privacy leakage incurred by data updates. As demonstrated by our theoretical analysis as well as our experimental evaluation, LOLOHA achieves a utility competitive to current state-of-the-art protocols, while substantially minimizing the longitudinal privacy budget consumption by up to k/g orders of magnitude

INRIA a CCSD electronic archive server

HAL-Polytechnique

Frequency Estimation of Evolving Data Under Local Differential Privacy

Author: Gambs Sébastien
H. Arcolezi Héber
Palamidessi Catuscia
Pinzón Carlos
Publication venue: HAL CCSD
Publication date: 23/12/2022
Field of study

INRIA a CCSD electronic archive server

Improving the utility of locally differentially private protocols for longitudinal and multidimensional frequency estimates

Author: Al Bouna Bechara
Arcolezi Héber H.
Couchot Jean-François
Xiao Xiaokui
Publication venue: 'Elsevier BV'
Publication date: 01/07/2022
Field of study

International audienceThis paper investigates the problem of collecting multidimensional data throughout time (i.e., longitudinal studies) for the fundamental task of frequency estimation under Local Differential Privacy (LDP) guarantees. Contrary to frequency estimation of a single attribute, the multidimensional aspect demands particular attention to the privacy budget. Besides, when collecting user statistics longitudinally, privacy progressively degrades. Indeed, the "multiple" settings in combination (i.e., many attributes and several collections throughout time) impose several challenges, for which this paper proposes the first solution for frequency estimates under LDP. To tackle these issues, we extend the analysis of three state-of-the-art LDP protocols (Generalized Randomized Response-GRR, Optimized Unary Encoding-OUE, and Symmetric Unary Encoding-SUE) for both longitudinal and multidimensional data collections. While the known literature uses OUE and SUE for two rounds of sanitization (a.k.a. memoization), i.e., L-OUE and L-SUE, respectively, we analytically and experimentally show that starting with OUE and then with SUE provides higher data utility (i.e., L-OSUE). Also, for attributes with small domain sizes, we propose Longitudinal GRR (L-GRR), which provides higher utility than the other protocols based on unary encoding. Last, we also propose a new solution named Adaptive LDP for LOngitudinal and Multidimensional FREquency Estimates (ALLOMFREE), which randomly samples a single attribute to be sent with the whole privacy budget and adaptively selects the optimal protocol, i.e., either L-GRR or L-OSUE. As shown in the results, ALLOMFREE consistently and considerably outperforms the state-of-the-art L-SUE and L-OUE protocols in the quality of the frequency estimates

arXiv.org e-Print Archive

HAL - Université de Franche-Comté

INRIA a CCSD electronic archive server

HAL Descartes

HAL-Polytechnique

On the Impact of Multi-dimensional Local Differential Privacy on Fairness

Author: Arcolezi Héber, H
Brahim Ghassen, Ben
Makhlouf Karima
Palamidessi Catuscia
Zhioua Sami
Publication venue: HAL CCSD
Publication date: 03/09/2024
Field of study

International audienceAutomated decision systems are increasingly used to make consequential decisions on people's lives. Due to the sensitivity of the manipulated data as well as the resulting decisions, several ethical concerns need to be addressed for the appropriate use of such technologies, in particular, fairness and privacy. Unlike previous work which focused on centralized differential privacy (DP) or on local DP (LDP) for a single sensitive attribute, in this paper, we examine the impact of LDP in the presence of several sensitive attributes (i.e., multi-dimensional data) on fairness. Detailed empirical analysis on synthetic and benchmark datasets revealed very relevant observations. In particular, (1) multi-dimensional LDP is an efficient approach to reduce disparity, (2) the multi-dimensional approach of LDP (independent vs combined) matters only at low privacy guarantees (high ϵ), and (3) the outcome Y distribution has an important effect on which group is more sensitive to the obfuscation. Last, we summarize our findings in the form of recommendations to guide practitioners in adopting effective privacy-preserving practices while maintaining fairness and utility in ML applications

HAL-Polytechnique

Improving the utility of locally differentially private protocols for longitudinal and multidimensional frequency estimates

Author: Al Bouna Bechara
Arcolezi Héber H.
Couchot Jean-François
Xiao Xiaokui
Publication venue: 'Elsevier BV'
Publication date: 01/07/2022
Field of study

HAL - Université de Franche-Comté

HAL-Polytechnique